Page 1 of 6 
OIPE 



RAW SEQUENCE LISTING DATE: 07/26/2001 

PATENT APPLICATION: US/09/903,770 TIME: 15:19:35 

Input Set : A:\203979US.txt 
Output Set: N:\CRF3\07262001\I903770.raw 

3 <110> APPLICANT: MOLENAAR, DOUWE 

4 VAN DER REST, MICHEL E 

5 DRYSCH, ANDRE 
7 <120> TITLE OF INVENTION: NUCLEOTIDE SEQUENCE WHICH CODE FOR THE mdhA GENE 
9 <130> FILE REFERENCE: 203976US0X 

C--> 11 <140> CURRENT APPLICATION NUMBER: US/09/903,770 
C--> 11 <141> CURRENT FILING DATE: 2001-07-13 

11 <150> PRIOR APPLICATION NUMBER: DE 10032350.2 

12 <151> PRIOR FILING DATE: 2000-07-04 
14 <160> NUMBER OF SEQ ID NOS : 5 

16 <170> SOFTWARE: Patentln version 3.1 

18 <210> SEQ ID NO: 1 

19 <211> LENGTH: 18 

20 <212> TYPE: PRT 

21 <213> ORGANISM: Corynebacterium glutamicum 
23 <400> SEQUENCE: 1 

25 Asn Ser Pro Gin Asn Val Ser Thr Lys Lys Val Thr Val Thr Gly Ala 

26 1 5 10 15 
29 Gly Gin 

33 <210> SEQ ID NO: 2 

34 <211> LENGTH: 2663 

35 <212> TYPE: DNA 

36 <213> ORGANISM: Corynebacterium glutamicum 

38 <220> FEATURE: 

39 <221> NAME/KEY: CDS 

40 <222> LOCATION: ( 536 )..( 1519 ) 

41 <223> OTHER INFORMATION: 

44 <400> SEQUENCE: 2 

45 aaggcctttc ttatcgccaa agtgatagtg gatcatgcgc ttggacatgc cagatgcctt 60 
47 cgcgattttc tccaatttgg tttcgctaaa accatcctct gcaaaaaatg tcagagcggt 120 
49 ggctactacc tcttcagggg ttgcggtgtg tcctgaatca gattcaatga attcgctacc 180 
51 ggcctggtct atgttttcgg catctcgacg tgatgtcgcc ataatcgatc aattcctttc 240 
53 gggtaacgag aaaacgtgaa ttagaaacgg ggttaaggta aatatcaaag ataacaccat 300 
55 cggcaaatcc cagctgacaa ctataaatgg tgcccgatat caggaaaaat tgcttgcaca 360 
57 cgcgcgccga ttccccatga tgccctaaca tcttgcaggt gaggggtaca tattggggca 4 20 
59 attcgggggt aattttgcag tatcgtcaag atcacccaaa actggtggct gttctctttt 4 80 

61 aagcgggata gcatgggttc ttagaggacc ccctacaagg attgaggatt gttta atg 538 

62 Met 

63 1 

65 aat tec ccg cag aac gtc tec acc aag aag gtc acc gtc acc ggc gca 586 

66 Asn Ser Pro Gin Asn Val Ser Thr Lys Lys Val Thr Val Thr Gly Ala 

67 5 10 15 

69 get ggt caa ate tct tat tea ctg ttg tgg cgc ate gee aac ggt gaa 634 

70 Ala Gly Gin lie Ser Tyr Ser Leu Leu Trp Arg lie Ala Asn Gly Glu 

71 20 25 30 

73 gta ttc ggc acc gac acc cct gta gaa ctg aaa ctt ctg gag ate cct 682 

74 Val Phe Gly Thr Asp Thr Pro Val Glu Leu Lys Leu Leu Glu lie Pro 



ENTERED 
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75 35 40 45 

77 cag get ctt ggc ggg gca gag ggt gtg get atg gaa ctt ctg gat tct 730 

78 Gin Ala Leu Gly Gly Ala Glu Gly Val Ala Met Glu Leu Leu Asp Ser 

79 50 55 60 65 

81 gec ttc ccc etc ctg cga aac ate ace ate acc gcg gat gee aat gag 778 

82 Ala Phe Pro Leu Leu Arg Asn lie Thr lie Thr Ala Asp Ala Asn Glu 

83 70 75 80 

85 gca ttc gac ggc get aat gcg gcg ttt ttg gtc ggt gcg aag cct cgc 826 

86 Ala Phe Asp Gly Ala Asn Ala Ala Phe Leu Val Gly Ala Lys Pro Arg 

87 85 90 95 

89 gga aaa ggc gaa gag cgc gca gat ttg ctg get aac aac ggc aag att 874 

90 Gly Lys Gly Glu Glu Arg Ala Asp Leu Leu Ala Asn Asn Gly Lys lie 

91 100 105 110 

93 ttc gga cct caa ggt aaa get ate aat gac aac gee gca gat gac att 922 

94 Phe Gly Pro Gin Gly Lys Ala lie Asn Asp Asn Ala Ala Asp Asp lie 

95 115 120 125 

97 cgt gtc eta gtt gtt gga aac cca gcg aac acc aac gcg ttg att get 970 

98 Arg Val Leu Val Val Gly Asn Pro Ala Asn Thr Asn Ala Leu lie Ala 

99 130 135 140 145 



101 


tea 


get 


gcg 


gee 


cca 


gat 


gtt 


cca 


gca 


tec 


cgc 


ttc 


aac 


gca 


atg 


atg 


1018 


102 


Ser 


Ala 


Ala 


Ala 


Pro 


Asp 


Val 


Pro 


Ala 


Ser 


Arg 


Phe 


Asn 


Ala 


Met 


Met 




103 










150 










155 










160 






105 


cgc 


ctt 


gat 


cac 


aac 


cgt 


gcg 


ate 


tec 


cag 


ctg 


gee 


acc 


aag 


ctt 


ggc 


1066 


106 


Arg 


Leu 


Asp 


His 


Asn 


Arg 


Ala 


He 


Ser 


Gin 


Leu 


Ala 


Thr 


Lys 


Leu 


Gly 




107 








165 










170 










175 








109 


cgt 


gga 


tct 


gcg 


gaa 


ttt 


aac 


aac 


att 


gtg 


gtc 


tgg 


gga 


aat 


cac 


tec 


1114 


110 


Arg 


Gly 


Ser 


Ala 


Glu 


Phe 


Asn 


Asn 


He 


Val 


Val 


Trp 


Gly 


Asn 


His 


Ser 




111 






180 










185 










190 










113 


gca 


acc 


cag 


ttc 


cca 


gac 


ate 


acc 


tac 


gca 


acc 


gtt 


ggt 


gga 


gaa 


aag 


1162 


114 


Ala 


Thr 


Gin 


Phe 


Pro Asp 


He 


Thr 


Tyr 


Ala 


Thr 


Val 


Gly 


Gly 


Glu 


Lys 




115 




195 










200 










205 












117 


gtc 


act 


gac 


ctg 


gtt 


gat 


cac 


gat 


tgg 


tat 


gtg 


gag 


gag 


ttc 


att 


cct 


1210 


118 


Val 


Thr 


Asp 


Leu 


Val 


Asp 


His 


Asp 


Trp 


Tyr 


Val 


Glu 


Glu 


Phe 


He 


Pro 




119 


210 










215 










220 










225 




121 


cgc 


gtg 


get 


aac 


cgt 


ggc 


get 


gaa 


ate 


att 


gag 


gtc 


cgt 


gga 


aag 


tct 


1258 


122 


Arg 


Val 


Ala 


Asn 


Arg Gly 


Ala 


Glu 


He 


He 


Glu 


Val 


Arg 


Gly 


Lys 


Ser 




123 










230 










235 










240 






125 


tct 


gca 


get 


tct 


gca 


gca 


tec 


tct 


gcg 


att 


gat 


cac 


atg 


cgc 


gat 


tgg 


1306 


126 


Ser 


Ala 


Ala 


Ser 


Ala 


Ala 


Ser 


Ser 


Ala 


He 


Asp 


His 


Met 


Arg 


Asp 


Trp 




127 








245 










250 










255 








129 


gta 


cag 


ggc 


acc 


gag 


gcg 


tgg 


tec 


tct 


gcg 


gca 


att 


cct 


tec 


acc 


ggt 


1354 


130 


Val 


Gin 


Gly 


Thr 


Glu 


Ala 


Trp 


Ser 


Ser 


Ala 


Ala 


He 


Pro 


Ser 


Thr 


Gly 




131 






260 










265 










270 










133 


gca 


tac 


ggc 


att 


cct 


gag 


ggc 


att 


ttt 


gtc 


ggt 


ctg 


cca 


acc 


gta 


tec 


1402 


134 


Ala 


Tyr 


Gly 


He 


Pro 


Glu 


Gly 


He 


Phe 


Val 


Gly 


Leu 


Pro 


Thr 


Val 


Ser 




135 




275 










280 










285 












137 


cgc 


aac 


ggt 


gag 


tgg 


gaa 


ate 


gtt 


gaa 


ggc 


ctg 


gag 


att 


tec 


gat 


ttc 


1450 


138 


Arg 


Asn 


Gly 


Glu 


Trp 


Glu 


He 


Val 


Glu 


Gly 


Leu 


Glu 


He 


Ser 


Asp 


Phe 




139 


290 










295 










300 










305 
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141 cag cgc gcc cgc ate gac gcg aat get cag gaa ttg cag gee gag cgc 1498 

142 Gin Arg Ala Arg lie Asp Ala Asn Ala Gin Glu Leu Gin Ala Glu Arg 

143 310 315 320 

145 gag gca gtg cgc gac ttg etc taatctttaa egcatgaett cgettttcga 154 9 

146 Glu Ala Val Arg Asp Leu Leu 

147 325 

149 cgccccaacc ctccaacgcg tcaccgtttt cacgggctcg gcgctcggca gttcctcgct 1609 
151 gtacacgcaa gcggctcaaa ccttggcgaa aaccgeggta gaccgcggca tcgacttggt 1669 
153 ttacggtggc ggaaaagtgg ggctcatggg tategtcgeg gatgegttec tggaatcagg 1729 
155 tggegaagee tttggcgtca teaeggaate acttatgaag ggtgagcttg ggcatgaaaa 1789 
157 gctcaccgaa cttgaaatcg ttcctgatat gcacatccgc aagegtcgea tggcagaact 184 9 
159 tggcgatggt tttatcgeca tgcccggtgg cgccggcacc ttggaagaac ttttcgaggt 1909 
161 ctggacctgg caacagctgg gcattcatca aaagcccgtc gcactttatg atgtcgatgg 1969 
163 tttttggcag cccctgctgg aaatgcttga gcagatgacc cagcgtggat ttatcaagcg 2029 
165 agacttcttt gagtgectea tcgtggaatc cgacccgcat gccctgctaa aggcaatgea 2089 
167 gacctggact ccaccagcac caaaatggta actaaattgt gtgctcgacg gtaacgccgc 214 9 
169 cgagtatctt gatggaaatg gaagccacgc cgttgtcatt gactgtgatg gtttcttcta 2209 
171 cttctgggcc ategaaaegt gaaatctegg tagcatccac atcggtgatg gagctatcaa 2269 
173 aaggaatctt gatttcactg agcagggaaa tatctceggg gctgccatcc teggacaegg 2329 
175 tggagtattc cacgaacctg aaccaaccaa tgttgtgcac cgccttgtag categttteg 2389 
177 ccacggtcgc agaateggtg teeggggega teagegggtc aaagctcacg gcacgaccag 244 9 
179 aatcgtgctc aeggaacaca ccgatgcctc gcgcaacgcg gtcccttagg tggaaaccag 2509 
181 aggaagggtc agecgegatg gccagaccca ccgcagtgga acctgagggg aatggggagc 2569 
183 ggtggacacg gcggccgaaa cgctcgcgga gcaacctgga aacgagtggg agegaggate 2629 
185 cactagttct agageggecg ccaccgcggt ggag 2663 

188 <210> SEQ ID NO: 3 

189 <211> LENGTH: 328 

190 <212> TYPE: PRT 

191 <213> ORGANISM: Corynebacterium glutamicum 
193 <400> SEQUENCE: 3 

195 Met Asn Ser Pro Gin Asn Val Ser 

196 1 5 

199 Ala Ala Gly Gin lie Ser Tyr Ser 

200 20 

203 Glu Val Phe Gly Thr Asp Thr Pro 

204 35 40 45 

207 Pro Gin Ala Leu Gly Gly Ala Glu Gly Val Ala Met Glu Leu Leu Asp 

208 50 55 60 

211 Ser Ala Phe Pro Leu Leu Arg Asn lie Thr lie Thr Ala Asp Ala Asn 

212 65 70 75 • 80 

215 Glu Ala Phe Asp Gly Ala Asn Ala Ala Phe Leu Val Gly Ala Lys Pro 

216 85 90 95 



Thr 


L ys 


Lys 


Val 


Thr 


Val 


Thr 


Gly 




10 










15 




Leu 


Leu 


Trp 


Arg 


lie 


Ala 


Asn 


Gly 


25 










30 






Val 


Glu 


Leu 


Lys 


Leu 


Leu 


Glu 


He 



219 


Arg 


Gly 


Lys 


Gly 


Glu 


Glu 


Arg 


Ala 


Asp 


Leu 


Leu 


Ala 


Asn 


Asn 


Gly 


Lys 


220 








100 










105 










110 






223 


He 


Phe 


Gly 


Pro 


Gin 


Gly 


Lys 


Ala 


He 


Asn 


Asp 


Asn 


Ala 


Ala 


Asp 


Asp 


224 






115 










120 










125 








227 


He 


Arg 


Val 


Leu 


Val 


Val 


Gly 


Asn 


Pro 


Ala 


Asn 


Thr 


Asn 


Ala 


Leu 


He 


228 




130 










135 










140 










231 


Ala 


Ser 


Ala 


Ala 


Ala 


Pro 


Asp 


Val 


Pro 


Ala 


Ser 


Arg 


Phe 


Asn 


Ala 


Met 
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232 


145 








150 










155 










160 


235 


Met Arg 


Leu 


Asp 


His 


Asn 


Arg 


Ala 


He 


Ser 


Gin 


Leu 


Ala 


Thr 


Lys 


Leu 


236 








165 










170 










175 




239 


Gly Arg Gly 


Ser 


Ala 


Glu 


Phe 


Asn 


Asn 


He 


Val 


Val 


Trp Gly 


Asn 


His 


240 






180 










185 










190 






243 


Ser Ala 


Thr 


Gin 


Phe 


Pro 


Asp 


He 


Thr 


Tyr 


Ala 


Thr 


Val 


Gly 


Gly 


Glu 


244 




195 










200 










205 








247 


Lys Val 


Thr 


Asp 


Leu 


Val 


Asp 


His 


Asp 


Trp 


Tyr 


Val 


Glu 


Glu 


Phe 


He 


248 


210 










215 










220 










251 


Pro Arg 


Val 


Ala 


Asn 


Arg 


Gly 


Ala 


Glu 


He 


He 


Glu 


Val 


Arg 


Gly 


Lys 


252 


225 








230 










235 










240 


255 


Ser Ser 


Ala 


Ala 


Ser 


Ala 


Ala 


Ser 


Ser 


Ala 


He 


Asp 


His 


Met 


Arg 


Asp 


256 








245 










250 










255 




259 


Trp Val 


Gin 


Gly 


Thr 


Glu 


Ala 


Trp 


Ser 


Ser 


Ala 


Ala 


He 


Pro 


Ser 


Thr 


260 






260 










265 










270 






263 


Gly Ala 


Tyr Gly 


He 


Pro 


Glu 


Gly 


He 


Phe 


Val 


Gly 


Leu 


Pro 


Thr 


Val 


264 




275 










280 










285 








267 


Ser Arg 


Asn 


Gly Glu 


Trp 


Glu 


He 


Val 


Glu 


Gly 


Leu 


Glu 


He 


Ser 


Asp 


268 


290 










295 










300 










271 


Phe Gin 


Arg 


Ala 


Arg 


He 


Asp 


Ala 


Asn 


Ala 


Gin 


Glu 


Leu 


Gin 


Ala 


Glu 


272 


305 








310 










315 










320 


275 


Arg Glu 


Ala 


Val 


Arg 


Asp 


Leu 


Leu 



















276 325 

279 <210> SEQ ID NO: 4 

280 <211> LENGTH: 18 

281 <212> TYPE: DNA 

282 <213> ORGANISM: Artificial Sequence 

284 <220> FEATURE: 

285 <223> OTHER INFORMATION: synthetic DNA 

287 <400> SEQUENCE: 4 

288 aargtyacyg tyacyggy 18 

291 <210> SEQ ID NO: 5 

292 <211> LENGTH: 17 

293 <212> TYPE: DNA 

294 <213> ORGANISM: Artificial Sequence 

296 <220> FEATURE: 

297 <223> OTHER INFORMATION: synthetic DNA 

299 <400> SEQUENCE: 5 

300 cgrttrtgrt cvarrcg 17 
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VERIFICATION SUMMARY DATE: 07/26/2001 

PATENT APPLICATION: US/09/903 f 770 TIME: 15:19:36 
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L:ll M:270 C: Current Application Number differs, Replaced Current Application No 
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